# High-resolution image generation

FLUX.1 Dev GGUF
Other
FLUX.1 [dev] is a rectified flow transformer with 12 billion parameters, capable of generating high-quality images based on text descriptions, providing powerful image generation capabilities for developers and creative professionals.
Text-to-Image English
F
unsloth
371
1
Simpletuner Lora
Other
simpletuner-lora is a text-to-image and image-to-image conversion model based on PEFT LoRA, derived from the FLUX.1-dev model.
Text-to-Image
S
binarydaddy
249
0
Simpletuner Lora
Other
LyCORIS adapter based on Stable Diffusion 3.5 Medium, specializing in photorealistic image generation
Image Generation
S
hmwhwm
209
0
Stylloha
Other
A LyCORIS adapter based on FLUX.1-dev, focusing on text-to-image and image-to-image tasks, supporting multiple resolution outputs.
Image Generation
S
quzo
62
0
Flux Lora Training
Other
This is a standard PEFT LoRA derivative model based on FLUX.1-dev, focusing on text-to-image and image-to-image generation tasks.
Image Generation
F
Forezeztgump
94
0
Hidream5m Photo 1mp Prodigy
Other
LyCORIS adapter based on HiDream-I1-Full, focusing on high-quality image generation
Image Generation
H
bghira
100
0
Auraflow V0.3
Apache-2.0
AuraFlow v0.3 is a fully open-source flow-based text-to-image generation model that supports multiple aspect ratios, with resolutions up to 1536 pixels.
Text-to-Image
A
terminusresearch
80
1
Reddy V4
Other
Standard PEFT LoRA model based on FLUX.1-dev, specializing in generating high-quality female character images
Image Generation
R
Unmapped2895
59
0
Ben Brand LoRA
Other
A PEFT LoRA model trained based on FLUX.1-dev, focused on text-to-image generation tasks, supporting specific artistic style conversion.
Image Generation
B
davidrd123
253
1
Dc Ae F32c32 Sana 1.1 Diffusers
MIT
DC-AE is a novel autoencoder architecture designed to accelerate high-resolution diffusion models. It maintains reconstruction quality at high spatial compression ratios through residual autoencoding and decoupled high-resolution adaptation techniques.
Image Generation
D
mit-han-lab
1,127
4
Dc Ae F32c32 Sana 1.1
DC-AE is a novel autoencoder architecture designed to accelerate high-resolution diffusion models, addressing reconstruction accuracy issues under high compression ratios
Image Generation
D
mit-han-lab
18.17k
7
Sana 600M 1024px
Sana is an efficient text-to-image framework capable of generating images with resolutions up to 4096×4096, featuring rapid synthesis of high-resolution, high-quality images.
Text-to-Image Supports Multiple Languages
S
Efficient-Large-Model
285
19
Sana 1600M 1024px MultiLing
Sana is an efficient text-to-image framework capable of generating images with resolutions up to 4096×4096, supporting multilingual input.
Text-to-Image Supports Multiple Languages
S
Efficient-Large-Model
111
24
Flux GArt LoRA
Openrail
A text-to-image diffusion model fine-tuned based on FLUX.1-dev, specializing in GArt style image generation
Image Generation
F
prithivMLmods
69
4
Sana 1600M 1024px
Sana is an efficient text-to-image framework capable of generating images up to 4096×4096 resolution, deployable on laptop GPUs.
Image Generation Supports Multiple Languages
S
Efficient-Large-Model
2,327
206
Ebook Creative Cover Flux LoRA
Openrail
A text-to-image model based on LoRA technology, specifically designed for generating e-book cover designs
Image Generation
E
prithivMLmods
173
21
Mlx Stable Diffusion 3.5 Large
Other
MLX framework version optimized from Stable Diffusion 3.5 Large, specifically designed for Apple chip-optimized text-to-image generation models
Image Generation English
M
argmaxinc
502
7
SD3.5 LoRA Futuristic Bzonze Colored
Other
A LoRA fine-tuned model based on Stable Diffusion 3.5, specialized in generating images with a futuristic bronze color style.
Image Generation
S
Shakker-Labs
97
27
Meissonic
Apache-2.0
Meissonic is a non-autoregressive masked image modeling text-to-image model capable of generating high-resolution images, specifically designed to run on consumer-grade GPUs.
Text-to-Image English
M
MeissonFlow
47
103
Cogview3 Plus 3B
Apache-2.0
CogView3-Plus-3B is the DiT version of CogView3, supporting text-to-image generation from 512 to 2048 pixels.
Text-to-Image English
C
THUDM
385
31
Illustrious Xl V01 Sdxl
Other
An early release version based on Stable Diffusion XL, focusing on generating anime-style illustrations through text-to-image modeling
Image Generation English
I
John6666
135
3
Mlx Stable Diffusion 3 Medium
Other
MLX implementation of Stable Diffusion 3 Medium, focused on text-to-image generation
Image Generation English
M
argmaxinc
238
2
Flux Controlnet Hed V3
Other
Hed ControlNet checkpoint specifically designed for the FLUX.1-dev model, for image generation tasks
Image Generation English
F
XLabs-AI
10.17k
65
Flux Controlnet Depth V3
Other
FLUX.1-dev ControlNet is a deep ControlNet checkpoint developed by Black Forest Labs, suitable for image generation tasks at 1024x1024 resolution.
Image Generation English
F
XLabs-AI
9,649
112
Flux Controlnet Canny V3
Other
Canny edge detection control network checkpoint for the FLUX.1-dev model, suitable for 1024x1024 resolution image generation
Image Generation English
F
XLabs-AI
5,419
123
Auraflow V0.3
Apache-2.0
AuraFlow v0.3 is a fully open-source flow-based text-to-image generation model that supports multiple aspect ratios, with resolutions up to 1536 pixels.
Text-to-Image
A
fal
1,224
127
FLUX.1 Dev IP Adapter
Other
IP adapter for the FLUX.1-dev model, supporting image processing similar to text for text-to-image generation tasks
Text-to-Image English
F
InstantX
8,361
279
Flux Controlnet Collections
Other
The FLUX.1-dev ControlNet Collection provides three pre-trained models (Canny edge detection, HED edge-aware, Midas depth map), optimized for 1024x1024 resolution image generation.
Image Generation English
F
XLabs-AI
22.35k
483
Zelda Lora
Openrail
Stable Diffusion XL (SDXL) 1.0 is a powerful text-to-image model capable of generating high-quality images from text descriptions.
Image Generation TensorBoard
Z
nroggendorff
31
1
Lumina Next SFT Diffusers
Apache-2.0
Lumina-Next-SFT is a 2-billion-parameter Next-DiT model that uses Gemma-2B as the text encoder and is enhanced through high-quality supervised fine-tuning (SFT) for text-to-image generation.
Text-to-Image
L
Alpha-VLLM
8,442
25
Pixart 900m 1024 Ft V0.6
Openrail
A fully fine-tuned image generation model based on ptx0/pixart-900m-1024-ft-large, specializing in high-quality image generation
Image Generation
P
terminusresearch
4,111
24
Colorfulxl
Colorful XL is a text-to-image generation model based on stable diffusion technology, capable of producing high-quality and diverse images from text descriptions.
Text-to-Image English
C
recoilme
5,810
11
Kohaku XL Epsilon Rev2
Other
A text-to-image generation model based on Amber XL Epsilon rev1, optimized for selected artist works and specific series/game-related images
Image Generation English
K
KBlueLeaf
46
24
Controlnet Canny Sdxl 1.0
Apache-2.0
A powerful control network model capable of generating high-resolution images with visual quality comparable to Midjourney, achieving precise control through Canny edge detection.
Image Generation
C
xinsir
25.79k
183
Terminus Xl Velocity V2
Openrail
A full-rank fine-tuned text-to-image generation model based on terminus-xl-velocity-v1, supporting multiple resolution outputs
Image Generation
T
bghira
875
8
Pixart Sigma XL 2 1024 MS
PixArt-Σ is a latent diffusion model based on the Transformer architecture, capable of generating high-resolution images (up to 4K) directly from text prompts.
Image Generation
P
PixArt-alpha
7,283
87
Envy Arcane Xl 01
Other
A LoRA model fine-tuned based on Stable Diffusion XL 1.0, specializing in generating arcane magic-style fantasy concept art images
Image Generation
E
e-n-v-y
56
4
SD15 768
Openrail
An image generation model fine-tuned based on the Stable Diffusion 1.5 framework, optimized for high-resolution output stability and supports multiple aspect ratio image generation
Text-to-Image English
S
panopstor
43
2
Deliberate2
Deliberate 2 is a text-to-image generation model that supports generating images in styles such as general, anime, and art.
Image Generation
D
Yntec
998
8
Futaall V8 VAE Diffusers
Other
A text-to-image generation model based on stable diffusion technology, capable of producing high-quality images from text descriptions.
Image Generation
F
digiplay
2,916
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase